Entropy in the natural time-domain* 
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A surrogate data analysis is presented, which is based on the fluctuations of the "entropy" S 
defined in the natural time-domain [Phys. Rev. E 68, 031106, 2003]. This entropy is not a static 
one as, for example, the Shannon entropy. The analysis is applied to three types of time-series, 
i.e., seismic electric signals, "artificial" noises and electrocardiograms, and "recognizes" the non- 
Markovianity in all these signals. Furthermore, it differentiates the electrocardiograms of healthy 
humans from those of the sudden cardiac death ones. If 8S and SS s huf denote the standard deviation 
when calculating the entropy by means of a time-window sweeping through the original data and 
the "shuffled" (randomized) data, respectively, it seems that the ratio SS s huf /SS plays a key-role. 
The physical meaning of SSshuf is investigated. 

PACS numbers: 05.40.-a, 87.17.-d 



I. INTRODUCTION 



In an electric signal consisting of N pulses, the nat- 
ural time was introduced[l, 2] by ascribing to the fc-th 
pulse the value \k — k/N. The analysis is then made 
in terms of the couple (xfc, Qk) where Qk stands for the 
duration of the fc-th pulse. The entropy S, defined[l, 3] 
as S = (xlnx) - (x) m (x)> where (x) = J2k=iPkXk, 

Pk=Qk/Y.n=\Qn and (xlnx) = J2k=i VkXk In Xfc, wa s 
found[3] to distinguish Seismic Electric Signals (SES) 
activities from artificial noises (AN), where the latter 
terminology stands for electrical disturbances which are 
recorded at a measuring site due to nearby man-made 
electric sources. More precisely, SES activities and AN 
have S- values smaller and larger than that (S u ) of a "uni- 
form" (u) distribution, respectively (as the latter was de- 
fined in Refs. [1, 3, 4]). Furthermore, ion current fluctu- 
ations in membrane channels (ICFMC) have S very close 
to S u [3]. 

The fact that a system contains nonlinear components 
does not necessarily reflect that a specific signal we mea- 
sure from the system also exhibits nonlinear features. 
Thus, before analyzing this signal by applying nonlin- 
ear techniques, we must first clarify if the use of such 
techniques is justified by the data available. The method 
of surrogate data has been extensively used to serve such 
a purpose (see Ref. [5] for a review) . Surrogate data refer 
to data that preserve certain linear statistic properties of 
the experimental data, but are random otherwise [6, 7]. 
These data are prepared by various procedures; for exam- 
ple, Siwy et al.[7] in order to study the nature of dwell- 
time series in ICFMC, among other methods, also used 
surrogate data which have been obtained by three differ- 
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ent procedures. The present paper aims, in general, at 
presenting a kind of surrogate data analysis using the en- 
tropy fluctuations in the natural time-domain (see below) 
as discriminating statistics. Throughout the paper, the 
surrogate data are obtained by shuffling the Qk randomly 
and hence their distribution is conserved. Applying such 
a procedure, we do the following: consider the null hy- 
pothesis that the data consist of independent draws from 
a fixed probability distribution of the dwell times; if we 
find significantly different serial correlations in the data 
and their shuffles, we can reject the hypothesis of inde- 
pendence, see paragraph 3.1 of Rcf. [5]. In other words, 
the tested null hypothesis is that Qk are independent and 
identically distributed (iid) random variables, i.e., that 
there are no correlations between the lengths of consec- 
utive intervals. If the original (continuous) time series 
is Markovian then the null hypothesis for the Qk should 
hold, i.e., the Qk are iid. We emphasize that the termi- 
nology "Markovian" throughout this paper always refers 
to the original time series. 

Here, as a measure of the natural time entropy fluc- 
tuations we consider the standard deviation SS when we 
calculate the value of S for a number of consecutive pulses 
and study how S varies when sweeping this time- window 
through the whole time-series. We use the following three 
data sets: Two of them are those treated in Ref. [3], i.e., 
SES activities and AN. As a third one, we preferred to 
use, instead of ICFMC, the case of electrocardiograms 
(ECG), for several reasons, chief among of which are: (a) 
They are publicly accessible [8] . (b) Instead of the single 
ICFMC example, a large variety of ECG are available 
(i.e., 105 individuals are employed here, 10 healthy and 
95 patients), (c) The case of ECG is similar to ICFMC, 
in the sense that the S'-value in ECG results very close 
to S u as in ICFMC investigated in [3]. Note, however, 
that the intervals between heart beats fluctuate widely, 
e.g., [9]. 

A general agreement about whether or not normal 
heart dynamics are chaotic or not chaotic is still lack- 
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ing (e.g., see Ref.[10] and references therein). The most 
commonly used non-linear complexity measures are frac- 
tal dimensions of various kinds (e.g., correlation dimen- 
sion, Renyi dimensions). Each of them measures different 
aspects of the statistics on the attractor. On the other 
hand, Liapunov exponents and the Kolmogorov-Sinai en- 
tropy (K-S entropy) and entropy rates are measures of 
the dynamics on an attractor. Except for the K-S en- 
tropy and entropy rates, the other categories of complex- 
ity measures assume a purely deterministic system (e.g., 
see Ref.[ll]). Since a physiological time series may be 
due to a mixed process, stochastic and deterministic, the 
use of fractal dimensions in physiological time series has 
been occasionally criticizedfll]. On the other hand, en- 
tropy is a concept equally applicable to deterministic as 
well as stochastic processes. This is why we preferred to 
use the entropy in natural time (more precisely its fluc- 
tuations SS ) as discriminating statistics. The following 
point, however, should be stressed. Complexity measures 
based on static entropy (e.g., Shannon entropy) quantify 
statistical order in the time series. The underlying key- 
property of these complexity measures is the probability 
distribution of the (dwell times in the) data analyzed; 
thus, the result of such computations should be indepen- 
dent of permutations performed on the (sequence of the 
dwell times in the) time series as in a surrogate (random- 
ized) data set obtained by data shuffling. On the other 
hand, the entropy in natural time (and the relevant mea- 
sures) considers, from its definition, the sequential order 
(of beats); in other words, S is a dynamic entropy, i.e., it 
captures characteristics of the dynamics in a system. Ad- 
ditional comments on the importance of the fluctuations 
of S in ECG will be forwarded in Section V. 

In all examples, we use a sliding window of length three 
to ten pulses, except otherwise stated. Concerning the 
symbols: We reserve SS only for the case when the cal- 
culation is made by a single time-window, e.g., 5 pulses. 
The symbol SS denotes the average of the JS'-values cal- 
culated for a sequence of single windows, e.g., 3, 4 and 
5 pulses. Finally, (SS) stands for the 55-values averaged 
over a group of individuals, e.g., the 10 healthy subjects. 

The present paper is organized as follows: In section 

II, we investigate whether a distinction between SES ac- 
tivities and AN can be achieved by the ^S'-value alone. 
Furthermore, we examine if SS can recognize the non- 
Markovianity in all the signals investigated. In section 

III, we attempt to shed light on the quantity 6S s huf cal- 
culated in a surrogate (randomized) data set obtained 
by data "shuffling". We find that 5S s huf in ECG is a 
measure of a/fi (where /i and a stand for the mean value 
and the standard deviation of the corresponding inter- 
vals, see below). Section IV shows that the JS^uZ-value 
differs from SS, as expected (cf. the entropy S is not 
static entropyas mentioned above). The prominent role 
of the ratio SS s h u f / SS in distinguishing ECG of healthy 
humans from those suffered from sudden cardiac death 
is shown in Section V. The conclusions are summarized 
in Section VI. Finally, an Appendix is reserved to derive 



an exact relationship between 5S s h u f and a/fj, when Qk 
are iid. 



II. THE POSSIBILITY OF EMPLOYING SS TO 
"RECOGNIZE" THE NON-MARKOVIANITY 

We start by examining whether the JS'-values alone 
can distinguish SES activities from AN as well as "recog- 
nize" their non-Markovianity. Recall[2, 3], that SES and 
AN are time-series of dichotomous nature which are non 
Markovian. In a dichotomous Markovian time-series, the 
dwell times (Qk) are exponentially distributed; for such 
a series we plot, in Fig. 1(a), the SS- value versus the 
time-window length. (Since in the calculation of S only 
ratios of Qk are involved the result does not depend on 
the transition rates of the Markovian process.) The error 
shown in this case is on the average 7%. (The calcula- 
tion was made for a total number of 10 2 pulses, see be- 
low. Note that this error decreases upon increasing the 
number of pulses, i.e., it becomes 2% for 10 3 pulses, 
which will be used later). In the same figure, we insert 
the JS'-values calculated for the four SES activities (la- 
belled Kl, K2, A, U) and the six AN (labelled nl to n6) 
depicted in Fig. 1 of Ref. [3]. An inspection of Fig. 1(a) 
reveals the following conclusions. First, no distinction 
between SES activities and AN (both of which have es- 
timation errors comparable to the aforementioned error 
of the Markovian) is obvious. An inspection of Table I 
of [12], reveals that the number of pulses in three (out 
of the four) SES activities is around 10 2 , for K2, U and 
A (while for Kl, is « 310) and this is why we calculated 
here the Markovian case for 10 2 pulses. Second, concern- 
ing the possibility of "recognizing" the non-Markovianity 
(as discussed and shown in Rcfs. [2-4] by independent 
procedures): This could be possibly supported, only for 
the shorter time- windows (i.e., 3, 4 and possibly 5 pulses) 
for all SES activities as well as for most AN (i.e., n6, n4, 
n3, n2, possibly nl, but not for n5), see Fig. 1(a). 

We now investigate if the ^S'-values alone can "rec- 
ognize" the non-Markovianity in ECG. In a single si- 
nus (normal) cycle of an ECG, the turning points are 
labelled with the letters P, Q, R, S and T. We used 
here the QT database from physiobank [8]( see also [13]), 
which consists of 105 fifteen-minute excerpts of Holter 
recordings as follows: 10 from MIT-BIH Normal Si- 
nus Rhythm Database (i.e., healthy subjects, hereafter 
labelled H), 15 from MIT-BIH Arrhythmia Database 
(MIT), 13 from MIT-BIH Supraventricular Arrhythmia 
Database (MSV), 6 from MIT-BIH ST Change Database 
(MST), 33 from the European ST-T Database (EST), 
4 from MIT-BIH Long-Term ECG Database (LT) and 
24 from sudden death patients from BIH(SD). (cf. BIH 
denotes the Beth Israel Hospital). In Fig. 2, we plot, 
for the QRS-interval time-series, the <5S-value averaged 
over each of the aforementioned seven groups versus the 
time-window length. Since all time-series of these seven 
groups have ~ 10 3 intervals, we insert in the same figure 
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FIG. 1: (Color) (a) The 5S-values for each SES-activity and 
artificial noise versus the time-window length. The corre- 
sponding values for a Markovian time-series (10 2 pulses) are 
also plotted (green), (b) SS versus 5S s huf (time- window range 
3-5) for all the SES activities and AN in (a) . The straight line 
corresponds to 8S s huf = SS. 



the results calculated for a Markovian case (cf. with the 
procedure mentioned in the previous paragraph) of com- 
parable length w 10 3 . We see that the Markovian case 
exhibits SS- values that are roughly one order of magni- 
tude larger than those of the seven groups of humans, 
which clearly points to the non-Markovianity of all the 
signals in these groups. We emphasize that the same 
conclusions are drawn if we consider, instead of QRS-, 
the series of QT-intervals, or the beat-to-beat intervals 
(RR). In summary, the SS- value alone can well recognize 
the non-Markovianity in ECG. 



III. THE PHYSICAL MEANING OF SS sh 
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In Fig. 3(a) we plot, for each of the 105 individuals, 
the value of a/ \i versus the corresponding value of SS s h. u f 
(time-window range 3-10 beats) for the RR-intcrvals. 
The same is repeated in Figs. 3(b) and 3(c) for the QT- 
and QRS-intervals, respectively. All these three plots, 
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FIG. 2: (Color) The (^-values for the QRS-intervals (see the 
text) of the seven groups of humans versus the time-window 
length. The corresponding values for a Markovian time-series 
(10 3 pulses, labelled M) are also plotted. 



can be described by linear behavior and a least squares 
fitting to a straight line passing through the origin leads 
to the following slopes: 38.6 ± 0.6, 36.8 ± 0.2 and 40.1 
± 0.4, for the RR-, QT- and QRS-intervals, respectively. 
This points to the conclusion that SS s } lu f provides, as 
intuitively expected, a measure of o~/[i. (This, however, 
cannot be supported with certainty for the SES activities 
and AN.) Note that, although these three slopes are more 
or less comparable, they differ by amounts lying outside 
their standard error. Furthermore, it may be worthwhile 
to mention, that if we study altogether the RR-, QT- and 
QRS-intervals, for the 10 healthy humans only (Fig. 4), 
a good linearity oia/fi versus 5S s h u f results with a slope 
37.5 ± 0.4. (cf. if we study each of the three intervals 
separately, we find slopes that agree within the error mar- 
gins, i.e., 37.5 ± 0.4, 37.1 ± 0.7 and 37.8 ± 0.1 for the 
RR-, QT- and QRS-intervals, respectively). The origin 
of this common behavior merits further investigation. 

One could argue that Qk may become iid upon their 
shuffling. In the Appendix, we show that, when Qk are 
iid, SS is actually proportional to a/fi; the following re- 
lationship is obtained: 



SSshuf = ^7F=TE(^ ln 
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(2) 



and e denotes, as usually, the base of the natural loga- 
rithms. The relation (1) reveals that SS s h u f versus aj\x 
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must be a straight line with a slope ranging from 34.2 
to 40.4, for a time-window length 3 to 10. This result is 
comparable with the slopes determined above from the 
analysis of the ECG data. 
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FIG. 3: The cr/fl- value, for each of the 105 individuals, versus 
the corresponding 8S s huf-va\ue for the (a) RR-, (b) QT- and 
(c) QRS-intervals. The identity of the individual associated 
with each point can be found in Ref.[26]. 



IV. ON THE DIFFERENCE BETWEEN SS AND 

SSshuf 

We first comment on the difference between SS and 
fiS shu f in the SES activities and AN. In Fig. 1(b), the 
value of SS versus the corresponding 5S s i lu f was plotted, 
for each of the ten signals discussed in Fig. 1(a). The 
average values in Fig. 1(b) have been calculated over the 
three time-windows of 3, 4 and 5 pulses, since we men- 
tioned in Section II that the "recognition" of the non- 
Markovianity in all SES activities becomes possible in 
this time-window range. If we disregard n6, and despite 
the errors of around 5% (for the time- window range 3-5), 
we may say that there is a systematic tendency point- 
ing to a value of SS s huf /SS larger than unity (cf. the 
same conclusion is drawn, if we take the averages over 
the time-window range 3-10). This is consistent with 
the non-Markovianity of all these signals, because for a 
Markovian case we expect SS s huf—SS. (Since, by def- 
inition, SS s huf corresponds to the entropy fluctuations 
upon random mixing of Qk, see Section I, it is naturally 
expected that in a Markovian case the two quantities SS 
and SS s huf should coincide). Note that the reverse is not 
always true (thus the equality SS s huf — SS may also hold 
for non-Markovian time series) as will be demonstrated 
below with precise examples. 

We now proceed to compare SS s huf with SS in ECG. 
Figure 5(a) depicts the (JS'-values, calculated for each of 
the 105 individuals, versus the corresponding SS s h u f for 
the RR-intervals (time- window range 3-10 beats). The 
same is repeated in Figs. 5(b) and 5(c) for the QT- and 
QRS-intervals, respectively. In each case, we also plot 
the straight line SS s huf=SS to visualize that the vast 
majority of points fall below this line. The non-equality 
of SSshuf and SS has been also verified by applying the 
Wilcoxon paired signed-rank test recommended[14] to be 
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FIG. 4: (Color) The a/fjr value for RR-, QT_and QRS- 
intervals of the ten H versus the corresponding SS s huf-va\ue 
(time- window range 3-10 beats). The straight line results 
from a least squares fit of all the thirty points. For the iden- 
tity of the individual associated with each point see Ref. [26]. 



followed for non-Gaussian paired data. The tested null 
hypothesis is that the means of SS s h u f and SS are the 
same and is rejected at a level of significance well below 
0.01, since the data of Figs. 5(a), (b) and (c) lead to nor- 
mally distributed variables z — —8.29, —6.81 and —6.32, 
respectively (cf. the corresponding one-tailed asymp- 
totic significance is given by P{Z < z), i.e., the prob- 
ability to obtain a normally distributed variable which 
is smaller than z). Note that a least squares fit to 
a straight line passing through the origin, results in 
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FIG. 5: The 5S-value, for each of the 105 individuals versus 
the corresponding 5S s huf-value for (a) RR-, (b) QT- and (c) 
QRS-intervals. The straight line, drawn in each case, corre- 
sponds to 8S s huf — SS. For the identity of the individual 
associated with each point see Ref. [26]. 
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FIG. 6: (Color) The lvalue, in each ofthe 10 H (black) and 
24 SD (red), for the QT-intervals versus SSshuf (time- window 
range: 3-10 beats). Note that the values of the ordinates are 
appreciably smaller than the SS- value (~ 2 x 10~ 2 ) of the 
Markovian time-series (10 3 events) depicted in Fig. 2. 



the following expressions: SS — (0.76 ± 0.03)SS s huf, 
SS = (0.85 ± 0.02)5S shuf , 5S = (0.94 ± 0.02)SS shuf for 
the Figs. 5(a), 5(b), 5(c), respectively. The sampling rate 
F s in ECG is 250 Hz; thus, if we take as an example 
the RR-intervals, the experimental error in their alloca- 
tion is around 1/F S . The latter reflects in the calcula- 
tion of SS and SS s h u f errors which are drastically smaller 
than those required to eventually justify a compatibility 
of the expression SS = (0.76 ± 0.03)8S s huf , found from 
Fig. 5(a), with a straight line of slope equal to unity, i.e., 
SS = SS s h u f- 

The difference between SS and SS s huf in ECG could 
be understood in the context that the former depends 
on the sequential order (of beats), as mentioned in Sec- 
tion I, while the latter does not. Since short- (and long-) 
range correlations is a usual feature( see Ref. [15] and 
references therein) in heartbeat dynamics, which are pos- 
sibly destroyed (or become weaker) upon randomizing 
the data, more "disorder" is intuitively expected to ap- 
pear after randomization, thus reflecting SS s huf > SS. 
Furthermore, note that in all the three plots of Fig. 5 
there are some drastic deviations from the straight line 
SS = SS s huf- The origin of these deviations is currently 
investigated in detail. 

Finally, we further clarify the aforementioned point 
that the equality SS — SS s huf does not necessarily reflect 
Markovianity. In Fig. 6, we plot, for the QT-intervals, 
SSshuf versus SS (for time-window range 3-10 beats) 
for SD and H. We see that there are several individuals 
(mainly SD, see also next section) of which their points lie 
practically (i.e., within the error margins) on the straight 
line SS = 5S s huf- If we plot their SS- (or SS s huf-) values 
versus the time-window (in a similar fashion as in Fig. 
2), we find that these values are distinctly smaller than 
those of the Markovian case, thus making clear that these 
individuals cannot be characterized as exhibiting Marko- 
vian behavior. (This non-Markovianity holds for all H 
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and all SD.) 



V. THE USE OF 5S shuf /5S TO DISTINGUISH 
ECG OF HEALTHY HUMANS FROM THE 
SUDDEN CARDIAC DEATH ONES 

Here we focus only on two groups of ECG, namely H 
and SD, and examine whether they can be distinguished 
by means of the ratio SS s h u f /SS. We calculate this ratio, 
for each type of interval, at two ranges: (i) a short (s) 
range 3-4 beats (cf. consider that the smallest number 
allowed for the natural time-domain analysis is 3 beats) 
and (ii) a longer (L) range 50-70 beats. For the sake of 
convenience, we define v = 5S s h u f /SS, and hence the 
following ratios are investigated: v s {t) and vl{t), where 
r denotes the type of interval (i.e., t=RR, QRS or QT) 
and s,L refer to the range studied (i.e., s = 3-4 beats 
and L=50-70 beats). 

The calculated values for v s (r) and (t) for the three 
types of intervals are given, for all H and SD, in Table 
I. The minima miri^ [i/ k (t)] and maxima max#[^ K (r)] 
(where k denotes either the short, n = s, or the longer, 
k = L, range) among the healthy subjects are also in- 
serted in two separate rows, for each type of interval and 
each range studied. These minima and maxima are la- 
beled H min and H max , respectively. The cases of SD 
which have smaller and larger values than H min and 
H m ax (reported in each column) arc marked with su- 
perscripts "a" and "b" respectively. 

A careful inspection of Table I leads to the following 
main conclusion: All SD violate one or more H-limits 
(i.e., they have values that are smaller than H min or 
larger than H max ). We intentionally emphasize that this 
conclusion is also drawn even when disregarding the re- 
sults for the QT-intervals. (Concerning the latter in- 
tervals: Only 5 SD out of 24 violate the H-limits; how- 
ever, in all SD, their (JS-values themselves are larger than 
those in H, see also Fig. 6. The usefulness of this dif- 
ference will be discussed in detail elsewhere). In other 
words, when focusing our investigation solely on the RR- 
and QRS-intervals, all SD violate one or more of the 
four H-limits related to v s (RR), vl{RR), v s (QRS) and 
vl(QRS). This is important from practical point of view, 
because the RR- and QRS-intervals can be detected more 
easily (and accurately) than the QT- by means of an 
automatic threshold based detector (e.g., see Ref. [16], 
which evaluated the results of a detector that has been 
forwarded in Refs. [17] and [18] to determine automati- 
cally the waveform limits in Holter ECG). 

A further inspection of Table I leads to the follow- 
ing additional comment: When investigating the RR- 
intervals alone (which can be detected automatically 
more easily and precisely than the other intervals), i.e., 
studying u s (RR) and vl{RR), the vast majority of SD 
(22 out of 24 cases) can be distinguished from H (only 
two SD, i.e., sel30 and sel47, obey the corresponding H- 
limits). Specifically, concerning v s (RR), 15 SD have val- 



ues smaller than H m in — 1.18, while only one SD (i.e., 
scl43) has a value exceeding H max = 2.25; as for vl(RR), 
18 SD exceed H max = 0.77, while only 2 SD (i.e., sel34 
and sel42) have values smaller than H m in = 0.44. 

In what remains, we proceed to a tentative physi- 
cal interpretation of the above results, the main feature 
of which focuses on the fact that most SD simultane- 
ously have ^ s (i?i?)-values smaller than H m i n (= 1.18) 
and z/j,(ii.R)-values exceeding H max {= 0.77). The RR 
time-series of healthy subjects are characterized by high 
complexity (e.g., [15, 19]); this, if we recall that in a 
Markovian series we intuitively expect SS s h u f/SS = 1 
(and hence v s = 1 and v>l = 1), is compatible with the 
fact that in all H both v s (RR) and vl(RR) distinctly dif- 
fer from unity (see Table I). We now turn to SD by con- 
sidering that for individuals at high risk of sudden death 
the fractal physiological organization (long range corre- 
lations) breaks down and this is often accompanied by 
emergence of uncorrelated randomness (see [15] and ref- 
erences therein). It is therefore naturally expected that 
in SD the values of v s {RR) and vl{RR) become closer 
to the Markovian value (i.e., unity) compared to H; thus, 
in SD, v s (RR) naturally becomes smaller than the value 
1.18 (the corresponding _ff min -limit) and i>l{RR) larger 
than 0.77 (the corresponding H max -\\m\t). 

We now focus on the following important property 
of H: although both v s {RR) and vl(RR) differ from 
unity, as mentioned, they systematically behave differ- 
ently, i.e., v s (RR) > 1 while vl(RR) < 1. The exact 
origin of the latter difference has not yet been identi- 
fied with certainty, but the following comments might 
be relevant: First, in the frame of the frequency-domain 
characteristics of heart rate variability (e.g., [20]), we 
may state that v s (RR) and vl{RR) are associated with 
the high-frequency (HF, 0.15-0.4 Hz) and low-frequency 
(LF, 0.015-0.15 Hz) range in the RR tachogram ("in- 
stantaneous" heart rate,l/RR). An important difference 
on the effect of the sympathetic and parasympathetic 
modulation of the RR- intervals has been noticed (e.g., 
see [20] and references therein): Sympathetic tone is 
believed to influence the LF component whereas both 
sympathetic and parasympathetic activity have an ef- 
fect on the HF component (recall that our results show 
v s {RR) > vl(RR))- Second, at short time scales (high 
frequencies), it has been suggested [21] that we have rel- 
atively smooth heartbeat oscillations associated with res- 
piration (e.g., 15 breaths per minute corresponds to a 
4 sec oscillation with a peak in the power spectrum at 
0.25 Hz, see [20]); this is lost upon randomizing the con- 
secutive intervals Qk, thus probably leading to (larger 
variations -compared to the original experimental data- 
between the durations of consecutive intervals and hence 
to) 5S s huf -values larger than SS, i.e., a v s (RR)-value 
larger than unity (cf. an extension of the current analysis 
to a surrogate sequence for a simultaneous recording of 
the breath rate and the instantaneous heart rate, upon 
considering the points discussed in paragraph 4.6 of Ref. 
[5] , could greatly contribute towards clarifying the valid- 
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TABLE I: The values of the ratios SS s huf /SS in the short (s) range 3-4 (i/ s ) or in the longer (L) range 50-70 beats (vl) in H 
(sel!6265 to sel!7453) and SD (sel30 to sell7152) for the RR-, QRS- and QT-intervals 



individual 


3-4 beats (i/ s ) 






50-70 boats (v L ) 






R R 

1 V 1\ 


or 




RR 
till 




AT 

<4 i 


sell6265 


1.82 


1.00 


1.24 


0.48 


1.02 


0.76 


scll6272 


1.74 


0.99 


0.98 


0.77 


1.08 


1.11 


sell6273 


2.21 


1.00 


1.48 


0.50 


0.88 


0.71 


scll6420 


1.55 


0.98 


1.08 


0.53 


1.09 


0.90 


sell6483 


2.25 


1.02 


1.14 


0.52 


1.16 


0.92 


scll6539 


1.42 


1.06 


1.25 


0.50 


1.08 


0.65 


scll6773 


1.94 


1.00 


0.99 


0.44 


1.05 


0.96 


sell6786 


1.42 


1.00 


1.19 


0.56 


1.04 


0.77 


scll6795 


1.18 


0.98 


1.08 


0.73 


0.96 


0.99 


SC11 i 40O 


l.JO 


i ni 
1 .Ul 






n oq 
U.yo 


n oi 




1.18 


0.98 


0.98 


0.44 


0.88 


0.65 


H max 


2.25 


1.06 


1.48 


0.77 


1.16 


1.11 


sel30 


1.29 


l.ll 6 


1.09 


0.65 


0.72° 


1.09 


sel31 


0.96° 


1.08 6 


1.17 


1.23 6 


0.94 


0.62° 


sel32 


1.39 


1.14 6 


1.12 


1.02 6 


0.69° 


0.90 


sel33 


1.05° 


0.99 


1.00 


0.86 6 


0.82° 


0.99 


scl34 


2.11 


1.29 6 


1.11 


0.42° 


0.78° 


0.67 


sel35 


1.00° 


1.00 


0.96° 


1.01 6 


1.05 


1.08 


sg136 


1.02° 


1.02 


1.04 


0.92 6 


1.00 


0.88 


scl37 


1.07° 


1.18 6 


1.07 


0.55 


0.75° 


0.65 


sel38 


0.99° 


1.09 6 


1.13 


1.37 6 


0.89 


1.04 


sel39 


0.96° 


1.02 


1.06 


2.93 6 


0.92 


0.90 


sel40 


1.01° 


1.00 


0.93° 


0.78 6 


0.93 


1.29 6 


sol41 


1.07° 


1.04 


1.02 


1.07 6 


0.84° 


0.96 


sol42 


1.63 


1.08 6 


1.23 


0.42° 


1.06 


0.67 


scl43 


2.71 6 


l.ll 6 


1.05 


0.56 


0.76° 


0.89 


sel44 


0.91° 


0.95° 


0.88° 


2.24 6 


1.46 6 


1.32 6 


sel45 


0.98° 


1.24 6 


1.29 


0.98 6 


0.86° 


0.79 


sel46 


1.03° 


1.01 


1.03 


1.00 6 


0.84° 


1.01 


scl47 


1.56 


0.97° 


1.03 


0.45 


0.97 


1.01 


scl48 


0.82° 


1.18 6 


1.44 


1.48 6 


0.68° 


0.73 


sel49 


0.93° 


l.ll 6 


0.96° 


1.22 6 


0.70° 


1.14 6 


sel50 


1.05° 


0.98 


0.98 


0.93 6 


1.23 6 


1.50 6 


sel51 


1.25 


1.01 


0.97° 


1.05 6 


1.24 6 


0.91 


scl52 


1.50 


1.16 6 


1.22 


1.00 6 


0.73° 


0.68 


se!17152 


1.64 


1.01 


1.04 


0.90 6 


1.01 


0.97 



"These values are smaller than the minimum (H m i„) value of 
SS a h u f /8S in H for each range 

6 These values are larger than the maximum (H ma x) value of 
5S a h u f /5S in H for each range 



ity of such an explanation). Such an argument, if true, 
cannot be applied, of course, in the longer range 50-70 
beats and hence explain why the opposite behavior, i.e., 
SS s huf < SS, then holds. The latter finding must be in- 
herently connected to the nature itself of the long range 
correlations. The existence of the latter is pointed out 
from the fact that (in this range also) the RR-intervals re- 
sult in (5S'-values (~ 10~ 3 ) which significantly differ from 
the Markovian SS- value (<~ 10~ 2 ) (cf. the existence of 
the long range correlations in the heart rate variability 
has been independently established by several applica- 
tions of the detrended fluctuation analysis, e.g., see [21], 
[15] and references therein). 

A simplified interpretation of the results of Fig. 6, and 
in particular the reason why for the QT intervals the 
quantity SS is larger for the SD than for the H, could 
be attempted if we consider that: (i) S could be thought 
as a measure of the "disorder" (in the consecutive inter- 
vals) (ii) the essence of the natural time-domain analy- 



sis is built on the variation of the durations of consecu- 
tive pulses, and (hi) it has been clinically observed (e.g., 
see Ref. [22] ) that the QT interval (which corresponds to 
the time in which the heart in each beat "recovers" - 
electrically speaking- from the previous excitation) ex- 
hibits frequent prolonged values before cardiac death. 
Thus, when a time-window is sliding on an H-ECG, it 
is intuitively expected to find, more or less, the same 
S'-values (when sweeping through various parts of the 
ECG) and hence a small <5S-value is envisaged. By the 
same token, in an SD-ECG, we expect that, in view of 
the short-long-short sequences of the QT-intervals, the 
corresponding S'-values will be much different (compared 
to H), thus leading to a larger SS- value (cf. in the same 
frame we may also understand why the cr//i values -and 
hence SS s h u f, see Eq. (1)- are larger in SD than those 
in H, as shown in Fig. 6). The distinction between SD 
and H could be also understood in the context of dy- 
namic phase-transitions (critical phenomena), as follows: 
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In SD, since the dynamic phase transition (cardiac ar- 
rest) is approached, the fluctuations of S are expected 
to become larger, thus reflecting larger SS; such intense 
fluctuations are not expected, of course, for H. 



VI. CONCLUSIONS 

The main point emerged in the surrogate data analysis 
presented in this paper, is the key-role of the quantity 
fiS s h u f/5S. This ratio: 

1. reveals the non-Mar kovianity in all three types of 
signals analyzed here, i.e., SES activities, AN and ECG. 
In a Markovian case we have SS s h u f — SS, but the reverse 
is not always valid; it may happen that 5S s h u f /SS = 1, 
although SS- (and SS s h u f-) value drastically differ from 
that of the Markovian (this is the case of ECG) . 

2. differentiates the ECG of healthy humans (H) from 
those suffered from sudden cardiac death (SD). More pre- 
cisely, in SD, the SS s h u f/SS values of the RR (i.e. beat 
to beat) intervals become closer to the Markovian value 
(i.e., unity) compared to those in H. Furthermore, in SD, 
both SS -and 5S s h u f- values of the QT interval ( corre- 
sponding to the time in which the heart "recovers" from 
its previous excitation) are larger than those in H. 

As for the physical meaning of SS s h u f in ECG, it was 
shown to be a measure of a j [i. 



APPENDIX: INTERRELATION BETWEEN 8S shuf 
AND a//i IN THE CASE OF IID 

If we consider a time-series Qk, where Qk > 0, k — 
1, 2, ... N, we obtain the quantities pk = Qk/Y^i=i Qh 
which satisfy the necessary conditions [23]: pk > 
0,J2k=iP k ~ 1 to be considered as point probabili- 
ties. We then define[l-3] the moments of the natu- 
ral time X k = k/N as ( X q ) = Ef=i(W% and 
the entropy S = (x m x) — (x)l n (x): where (%lnx) = 
J2k=i(k/N)^n(k/N)pk- This Appendix is solely focused 
on a uniform distribution in the natural time-domain. 

We now consider the case when Qk are independent 
and identically distributed (iid) positive random vari- 
ables. It then follows that the expectation value E(pk) = 
E [Qfe/EiIi Qi] of Pfe equals 1/N: 



E(p fc ) 



1 

N' 



Equation (A.l) results from the fact that, since Qk are 

iid, we have: Eg^, Qk/ E<=i Ql] = 1 = NE(p k ). 
For the purpose of our calculations the relation be- 
tween the variance of pk, Var(pfe) = E[(pk — 1/N) 2 }, 
and the covariance of pk and pi, Cov(j>k,Pi) — E[{pk — 
1/N)(pi — 1/N)], is of central importance. Using the 
constraint EfeLiPfc = E leading to pk — 1/N — 
E;^fc(V-^ — an d the fact that Qk are iid, we obtain 



E [(pfc — 1/N) 2 ] = E (Pk-l/N^^il/N-pO 



— (N — 1)E [(p fc - l/JV)(pj - 1/N)]. Thus, we get 

n / \ Var(p fe ) 
Cov( P k,pi) = — N _ 1 ■ 

The A-dependence of Var(pk) is obtained from 



Var(p fe ) 



1 



E 



NQ k 



1 



(A.2) 



(A.3) 



where the quantity E[(NQk/ Eili Ql ~ I) 2 ] i s asymp- 
totically N- independent. The latter arises as follows: If 
E(Qfc) = fi and Var(Qfc) = ct 2 (< oo), as a result of the 
central limit theorem[24], we have E(E^Li Qk/N) = n 
and Var(Ef = i Qk/N) = a 2 /N. The latter two equa- 
tions, for large enough N imply that E[(NQk/ Ejli Ql ~ 
l) 2 ] w E[(Q k /p- l) 2 ] = ct 2 /m 2 - Thus, Eq.(A.3) becomes 



Var(p fe ) 



N 2 fi 2 ' 



(A.4) 



We now turn to the statistical properties of (x q ) ■ Using 
Eq.(A.l), we have 



H(x q )] = J2 



fc=i 



i 

N I N' 



(A.5) 



which, since[25] EfcLi fc<? = N q+1 /{q + 1) + N q /2 + 
o(N q ), reveals that E[(x 9 )] is again asymptotically N- 
independent because it approaches the value l/(q + 
1) with a "small" 1/(2N) correction. The variance 
Var[( X ")][= (S( X q )) 2 ] 



Var[(x 9 )] = E 



A' 



,fc=i 



N 



1 

N 



(A.6) 



after expanding the square and using Eqs.(A.2) and 
(A.4), becomes: 



Var[ ( X q ) ] = 



fc=i 



k\ 2q a 2 



N 



N 2 p 2 

N / , \ q N 



(A.l) which, using Eq.(A.5), finally leads to: 



Var[(x 9 )] 



(N - 1)M< 



{E[(x 2q )]-E 2 [( X q )]}. (A.8) 



The proof of Eq.(A.8) can be generalized for all linear 
functionals of pk of the form (/(%)) = Efc=i f(k/N)pk 
and yields: 

v Uf< ttl ^ 2 {E[(/ 2 (X))]-E 2 [(j(x))]} Q . 
Var[(/(x))] = — i {N _ l)y 2 -■ ( A - 9 ) 
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In Fig. (7), we compare the theoretical result of Eq.(A.8) 
with synthetic (Gaussian) data which have values of /i, 
a and size (« 1000) similar to those in ECG. Note that 
when one uses the estimator (5X) 2 = ^(X — X) 2 /N, 
instead of the unbiased estimator (SX) 2 = E(AT ~ 
X) 2 /(N — 1), in order to find the sample variance, N 
should replace N — 1 in Eq.(A.8). 

We now proceed to the statistical properties of the 
entropy [1, 3] S = (x^O-X) — (x)ln(x)- The expectation 
value 



E(5) = E 



N 



V — In ( — 



N 



^-E^ME^ 

./c=i ■ ■ ■ ' fe=i \/=i / 

(A.10) 

can be evaluated as follows: we add and subtract the 
term J2k=i j;P k ^ n Ej=i (w) ~k > an ^ tnen ex P an d the 

resulting term ln[l + Y^Li TT&l ~ 7?) I Ya=i w\ to first 
order in (p; — jf); finally, using Eq.(A.8), we obtain 



N' 
N 



N 2 



hi 



JV 

E 

k=l 



N 2 



In 



^ 2 [Ef =1 k 2 /N* - (Ef =1 k/N 




(A.ll) 



This equation reveals that E(S') depends slightly on aj [i\ 
upon increasing N the last term of Eq.(A.ll) decays as 
l/N (cf. the sums in the numerator and the denomina- 
tor are of the form E[(x q }], for gr=l and 2, and asymp- 
totically lead to a constant l/(q + 1), see the relevant 
discussion after Eq.(A.5)). 



To simplify the calculation of the variance of the en- 
tropy Var(S'), we define the two linear functionals 



N 

E 

fc=i 



m[x k \ = 2^ jjX k , 



(A.12) 



N 



=£^k(^)**> (A.13) 



and the constant time-series IK = {x k } : x k — l/N, k = 
1,2,...N. Note that for both functionals m[xk] and 
L[xk,£], in view of their linearity, we have 



E \ m[p k - 1]| = E U[p k = D- (A.I -I ) 



Using Eqs.(A.12) and (A.13) the entropy can be written, 
in compact form, as follows: 





0.16 r 




0.14 




0.12 




0.1 






o 


0.08 - 


03 




3 
-t— • 

o 


0.06 


3 
LL 


0.04 




0.02 










5<x> 






SS « 








S = L \pk,m\p k ]] , 






and its expectation value is written as 







(A.15) 



10 20 30 40 50 60 70 80 90 100 
Number of beats 



2 

E(S) = L[K, l]-m[K] lnmpK]- ° ^ (A.16) 



(N — l)/i 2 m[K] ' 



FIG. 7: (Color) Comparison of the theoretical estimations 
(solid lines) of S(x) and SS resulting from Eqs.(A.8) and 
(A. 21), respectively, with the values obtained (plus and cross, 
respectively) using a Gaussian sample having values of fi, a 
and size (~ 1000) similar to those in ECG. Here, as well as 
throughout the paper, the estimator (SX) 2 = ^2(X — X) 2 /N 
was used for the calculation of the sample variance of the syn- 
thetic data, and thus N — 1 was replaced by TV in Eqs.(A.8) 
and (A.21). 



where k 1jU = E[( X 2 )] - E 2 [( X )]. 

The variance of the entropy Var(S) = (SS) 2 can 
then be found by adding and subtracting the terms 
m[pfc] lnm[K] and m[p k — l/N] and using the expansion 
m [Pk] In i^prr = m[p k }m[p k - l/N]/m[K\; this gives: 
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Var(S*) = E 



E 



= E 



E 



E 



L[p k , 1] - m[pk] \nm[pk] - L[K, 1] + m[K] lnmf 



(N - 1)^ 2 to[K] 



1 TYL \pk 1 1 

L\p k --,!]- mb,] In ^ + m[- - „*] lnm[K] + ^ _ 



L 



L 



L 



Pk - -^,m[K] 



Pk - jj,m\K] 



Pk - — ,m[K]e 



■m[p k ]m[p k - jf 
m[K] 



+ 



<J 2 Ki 



(N - l)/i 2 m[ 



r 1, ™ 2 bfe-^] 

iv m[K\ 
m2 [Pk~j]} , cr 2 Ki, u 



2 

(7 Ki 



(A - l)M 2 m[K] 



m[K] (A- l) M 2 m[]K] 

Expanding the square in Eq.(A.17), and using Eq.(A.14), we find 



(A.17) 



Var(S) = E L 2 



1 r ' 

Pk - — ,m[KJe 



1 



Pk - — ,m[Jkje 



+ 



2L 

m \Pk - jf] 



m2 iPk - j?] 
m[K] 



4 2 
UK 



TO" 



(TV - 1)Vto 2 [K] 



(A.18) 



//we assume that the distribution of Q k is skewnessless, i.e. E[(Qfc — /i) 3 ] = 0, the expectation value of the second 
term in Eq.(A.18) vanishes, whereas the third and the fourth terms arc of order 1/N 2 and hence negligible with 
respect to the first term. Thus, 



Var(S) = E ( L 2 
which can be explicitly written as follows 



Pk - —,m[K]e 



Var(S) = E 



N i 

V — In 

^-J N \m\K]Ne 



Pk ~N 



(A.19) 



(A.20) 



The right side of Eq.(A.20) becomes similar to Eq.(A.6), if we replace x q by x m (x/ m M e )i thus after expanding the 
square and using Eqs.(A.2) and (A. 4), we finally obtain 



Var(S) 



(N — l)/i 2 



N 



Nk 



k=i 



1 ( N k 



Nk 



(A.21) 



A comparison of Eqs.(A.19) and (A. 15) reveals the fol- and then directly take its variance according to Eq. (A. 9). 
lowing: in order to find the entropy fluctuation SS, one Equation (A.21) is just Eq.(l) of the main text, 
simply has to replace in Eq. (A. 15) m[p k ] with m[K]e 
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